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MEMORY COST OF QUANTUM CONTEXTUALITY 
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Abstract. The simulation of quantum effects requires certain classical re- 
sources, and quantifying them is an important step in order to characterize 
the difference between quantum and classical physics. For a simulation of the 
phenomenon of state-independent quantum contextuality, we show that the 
minimal amount of memory used by the simulation is the critical resource. 
We derive optimal simulation strategies for important cases and prove that 
reproducing the results of sequential measurements on a two-qubit system re- 
quires more memory than the information carrying capacity of the system. 



1. Introduction 



i-Q ' According to quantum mechanics, the result of a measurement may depend 

^]p, on which other compatible observables are measured simultaneously [l|43[. This 

■y ' property is called contextuality and is in contrast to classical physics, where the 

t^ , answer to a single question does not depend on which other compatible questions 

^ ' are asked at the same time. 

O^. Contextuality can be seen as complementary to the well known nonlocality of 

distributed quantum systems [J|. Both phenomena can be used for information 
processing tasks, albeit the applications of contextuality are far less explored [5|- 



^ ' |12[. Although contextuality and nonlocality can be considered as signatures of 

^D , nonclassicality, they can be simulated by classical models (3l ll3H15l |. However, while 

en 



nonlocal classical models violate a fundamental physical principle (the bounded 
speed of information), it is not clear whether contextual classical models violate any 
fundamental principle. Moreover, while the resources needed in order to imitate 
Li^ , quantum nonlocality have been extensively investigated |16l4l8|. there is no similar 



f^ ' knowledge about the resources needed to simulate quantum contextuality. 

In any model which exhibits contextuality in sequential measurements, the sys- 
tem will eventually attain different internal states during certain measurement se- 
quences. These states can be considered as memory — a model attaining the 

p\^ • minimal number of states is then memory-optimal and defines the memory cost. 

M , In this paper we investigate the memory cost as the critical resource in a classical 

simulation of quantum contextuality and we construct memory-optimal models for 
relevant cases. The amount of required memory increases as we consider more and 
more contextuality constraints. This can be used to quantify contextuality in a 
given quantum setting. We show that certain scenarios breach the amount of two 
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bits needed for the simulation of two qubits. This demonstrates that the memory 
needed to simulate only a small set of measurements on a quantum system may 
exceed the information that can be transmitted using this system (given by the 
Holevo bound [l9|) — a similar effect was observed so far only for the classical 
simulation of a unitary evolution [20[ . 



We focus on the following 
Mermin (PM) square 



Uowing 



2. Scenario 
set of two-qubit observables, also known as the Peres- 



B C cTz®! 1 (g) CTz az®(Jz 

a b c — \®ax (Jx®'^ (Jx®(^x ^ (1) 

a P ^\ \jJz ®(Jx (Jx® Oz cTy ® ay 

where ax, ay, and az denote the Pauli operators. The square is constructed such 
that the observables within each row and column commute and are hence compat- 
ible, and the product of the operators in a row or column yields 1, except for the 
last column where it yields —1. Thus the product of the measurement results for 
each row and column will be +1 except in the third column where it will be —1. 
In contrast, for a noncontextual model the measurement result for each observable 
must not depend on whether the observable is measured in the column or row con- 
text. Hence the number of rows and columns yielding a product of —1 is always 
even, as any observable appears twice. 

Similar to the Bell inequalities for local models, any noncontextual model satisfies 
the inequality 

(x) = {ABC) + {abc) + {al3-f) + {Aaa) + {Bhp) - {Cc-f) < 4, (2) 

while for perfect observables quantum mechanics (QM) predicts (x) = 6 |22l |. Here, 
the term (ABC) denotes the average value of the product of the outcomes of A, 
B, and C, if these observables are measured simultaneously or in sequence on the 
same quantum system. The violation is independent of the quantum state, which 
emphasizes that the phenomenon is a property of the set of observables rather than 
of a particular quantum state. 

Recently, this inequality has been experimentally tested using trapped ions |23[ , 
photons ^] , and nuclear magnetic resonance systems 25| . The results show a good 
agreement with the quantum predictions. In these experiments, the observables are 
measured in a sequential manner. Since the observed results cannot be explained 
by a model using only preassigned values, the system necessarily attains different 
states during some particular sequences, i.e., the system memorizes previous events. 
(Note that also in QM the system attains different states during the measurement 
sequences.) This leads to our central question: How much memory is required in 
order to simulate quantum contextuality? 

3. A FIRST MODEL 

Before we formulate the previous question more precisely, let us provide an ex- 
ample of a model that simulates the contextuality in the PM square. We assume 
that the system can only attain 3 different physical states 5*1, 5*2, and S3 (e.g. 
discrete points in phase-space). Let us associate a table to each state via 



Si: 



(+,2)- 
(+,3) 



^2 



(+,1) 



& 



(+,1) 
(-,2) 



(3) 



- (-,3) - 

Those tables define the model's behavior in the following way: If e.g. the system 
is in state Si and we measure the observable 7, consider the first table at the 
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position of 7 (i.e., the last entry in the third row). The + sign at this position 
indicates that the measurement resuh wih be +1, while the system stays in state 
Si. If we continue and measure C, we encounter the entry (+,2) which indicates 
the measurement result +1 and a subsequent change to the state ^2. Being in state 
S2, the second table defines the behavior for the next measurement: For instance 
a measurement of c yields now the result —1 and the system stays in state 82- 

Thus, starting in state Si, the measurement results for the sequence jCc are 
+1, +1, —1, so that the product is —1 in accordance with the quantum prediction. 
It is straightforward to verify that this model yields (x) = 6. In addition, the 
observables within each context are compatible in the sense that in sequences of 
the form AA, ABA, or AaaA, the first and last measurement of A yields the same 
output. In fact this particular model is memory- optimal (cf. Theorem [T|) and we 
assign the symbol ^3 to it. 

4. Memory cost of classical models 

Any model that reproduces contextuality eventually predicts that the system 
attains different states during some measurement sequences. As an omniscient 
observer one would know the state prior to each measurement and could include it 
in the measurement record. Thus, knowing the state of the system, one can predict 
the measurement outcome as well as the state of the system that will occur prior to 
the next measurement. Thus we can write any model that explains the outcomes 
of sequential measurements in the same fashion as we did for ^3 . 

Taking a different point of view, any such model can be considered to be an 
automaton with finitely many internal states, taking inputs (measurement settings) 
and yielding outputs (measurement results). In our notation, the output depends 
not only on the internal state, but also on the input. Such automatons are known 
as Mealy machines [26J, |27[ . 

The quantum predictions add restrictions on such an automaton and thus in- 
crease the number of internal states needed. As a simple example we could require 
that an automaton reproduces the quantum predictions from the rows and columns 
in the PM square. That is, for all sequences in the set 

=Sic = {ABC, abc, aj3^, Aaa, Bb^, Ccy, and permutations}, (4) 

we require that the automaton must yield an output that matches the quantum 
prediction. For example, QM predicts for the sequence ABC that the output is 
either +1,-1-1,-1-1 or one of the permutations of -|-1, — 1, — 1. 

More generally, if ^ denotes a set of measurement sequences, we say that an 
automaton A obeys the set ^ if the output for any sequence in £S matches the 
quantum prediction — i.e., if for any sequence in J2, the output of A could have 
occurred with a nonvanishing probability according to the quantum scenario. We 
say that a sequence yields a contradiction if the output of this sequence cannot 
occur according to QM. Hereby we consider all quantum predictions from any 
initial state (it would also suffice to only consider the completely mixed state g = 
1/ tr[l]). Furthermore, we assume that prior to the measurement of a sequence, the 
automaton always is re-initialized. This ensures that the output of the automaton 
is independent of any action prior to the selected measurement sequence. Note that 
we only consider the certain quantum predictions which occur with a probability 
of 1, while e.g. in the PM square we do not require that for the sequence A7 the 
probability of obtaining -1-1, -1-1 is equal to the probability of obtaining -1-1, —1. 
Finally, if an automaton with k states Si, . . . , Sk obeys ^ and there exists no 
automaton with less states obeying ^, we define the memory cost of ^ to be 
M(^)=log2(A:). 
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5. CONTEXTUALITY CONDITIONS 

Our definition of memory cost so far applies to arbitrary situations, even those 
in which contextuality does not directly play a role. In contrast, contextuality 
of sequential measurements corresponds to the particular feature, that certain se- 
quences of mutually compatible observables cannot be explained by a model with 
preassigned values (cf. Ref. [28| for a detailed discussion) . The contextuality condi- 
tions for observables Xi , X2 , ■ ■ ■ thus arise from the set of all sequences of mutually 
compatible observables, 

"Scontoxt = {-'^i-'^2 ■ ■ ■ \ Xi mutually compatible}. (5) 

If the choice of observables Xi, X2, . . . exhibits contextuality, then M(=Scontcxt) > 0. 
— In the case of the PM square, .Scontoxt surely contains all the row and column 
sequences that we included in JS^c- In addition, however, =Scontcxt contains e.g. the 
sequences AA, ABA, and AaaA, for which QM predicts with certainty a repetition 
of the value of A in the first and last instance. Note, that the set =Scontext also 
contains more complicated sequences like AC ABC A for which QM predicts with 
certainty that the values of A (C) in the first, third and sixth (second and fifth) 
measurement coincide, and that product of the outcome for ABC yields -1-1. 

A particular feature of contextuality is that one can find observables that ex- 
hibit contextuality independent of the actual preparation (the initial state) of the 
quantum system. Consequently, one may consider an extended preparation proce- 
dure of the automaton, where the experimenter performs additional measurements 
between the initialization of the automaton and the actual sequence. The experi- 
menter would e.g. measure the sequence bABC but consider the measurement of the 
observable b to be actually part of the preparation procedure. We write [b\ABC 
for a sequence where we are not interested in the result of b. If ^aii denotes the set 
of all sequences with observables Xi , X2 , . . . , we write 

^context = { [TJ ^ I ^ e i2contoxt , T € i2au} (6) 

for the set of all sequences in i?contcxt, including arbitrary preparation procedures. 
For the contextuality in the PM square, the automaton A3 obeys ^context, while 
no automaton with less than 3 states can obey .^context, cf. Appendix El for details. 
We did not specify an initial state for ^3 and indeed the contextuality conditions 
are obeyed for any initial state. We summarize: 

Theorem 1. The memory cost for the contextuality correlations ^context *^ ^^^ P^ 
square is log2(3) « 1.58 bits; M{^'^^^^^^^) = M(^contoxt) = log2(3). Consequently, 
the automaton A3 is memory- optimal. 

6. Compatibility conditions 

The set =Scontoxt contains all sequences of mutually compatible observables, but 
does not contain sequences like ABaA, for which QM also predicts that both occur- 
ring values of A are the same. Sequences of this form enforce that all observables 
compatible with an observable Y must not change the measurement result of Y . 
This can be covered by the set of all compatibility conditions 

^compat = {YIX1X2 ...\Y\Xt> compatible to Y}, (7) 

and a convincing test of contextuality must also test the correlations due to this set 
of sequences. Again we define ^compat to include arbitrary preparation procedures. 
The automaton A3 does not obey ^compat, since e.g. starting with state Si, 
the sequence B[Cl3\B yields the record -1-1, [-1-1,— 1, J— 1 and hence violates the 
assumption of compatibility; similar sequences can be found for any initial state. We 
show in Appendix [D] that no automaton with three states can obey simultaneously 
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^compat and ^context and hence M{^'^^^^p^^ and ^context) > 2. However, there 
exist automata with four internal states which obey ^compat and £2'^^^^^^^^. As an 
example of such an automaton we define A4 via 



^1: 



S: 



3- 



(+,2)- 
(+,3) 



(+,1) (-,4) + 



, S2 



04 '. 



.(-,4) (+a) + 

■ + - (-,3)- 

+ (-,2) 



(8) 



Similar to the situation for ^3 , the initial state for the automaton A4 can be chosen 
freely; we refer to Appendix [B] for details. So we have: 



Theorem 2. The memory cost of for the contextuality and compatibility correla- 
tions in the PM square is two bits; y[{^'^^^ ^ and =S^ontcxt) ~ 2. Consequently, 
the automaton Ai is memory- optimal. 



7. Extended Peres-Mermin square 

There are, however, further contextuality effects for two qubits which then re- 
quire more than two bits for a simulation. Namely, in Ref. [29| an extension of 
the PM square has been introduced, involving 15 different observables in 15 dif- 
ferent contexts. The argument goes as follows: Consider the 15 observables of the 
type (T^ ® cr,y where /i, z^ G {0,x,y,z} and ctq = 1 and the case /j, = ;/ = is 
excluded. In this set there are 12 trios of mutually compatible observables such 
that the product of their results is always -|-1, like [ax (8) 1, 1 (8> <Jy,ax (8> o-y] and 
[ax (El ay, ay (El ax, a z (El a z], and three trios of mutually compatible observables such 
that the product of their results is always —1, like [ax <Ei ay,ay (E az,az (E ax]- This 
leads to 15 contexts in total. Similar to the usual PM square, one can derive an 
state independent inequality. For this inequality, QM predicts a value of 15 for the 
total sum, while noncontextual models have a maximal value of 9, cf. Ref. [29| and 
Appendix lEl for details. 

One may argue that this new contextuality argument is stronger than the usual 
PM square [29| . Does a simulation of it require more memory than the original PM 
square? Indeed, this is the case: 

Theorem 3. The memory cost for the contextuality and compatibility correlations 
in the extended version of the PM square is strictly larger than two bits. 

More precisely, according to Eqns. ([5]) and (O we define the contextuality se- 

for the 15 observables 



quences ^compat,i5 and compatibility sequences JS'^ 



coinpat,15 



context, 15 

in the extension of the PM square. Then, the theorem states that M(=S^ 
and ^context, 15) > 2. The proof is based on the following idea: If one considers the 
15 contexts in the extended square then they can be arranged in a collection of 10 
distinct squares, each similar to the usual PM square. The contextuality in this 
arrangement is strong enough such that for each fixed assignment of the output, 
one must have three contradictions for one of the 10 usual PM squares. One can 
show, however, that any four-state solution obeying .S^oj^tg^t and =S^ompat is similar 
to A4, in which for no fixed state one has three contradictions. The full proof is 
given in Appendix [E] 

Although in this paper we are mainly concerned about the memory cost of con- 
textuality, we mention that the simulation of all certain quantum predictions of the 
PM square already requires more than two bits of memory. In fact, any four-state 
automaton that obeys =S^o„,pjjt and =S^o„t(,xt is up to some symmetries of the form 
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of ^4 (cf. Appendix lEl Proposition [5]), but for A4 the sequence [/3Ja6[CJc yields a 
contradiction. This proves that M(^aii) > 2. 

On the other hand, QM itself suggests an automaton for simulating contextuality. 
If e.g. we choose the pure state |0}(0| defined by A\<j)) — B\(f>) — {cf)) as initial state, 
then this state and all the states occurring during measurement sequences define 
an (nondeterministic) automaton. By a straightforward calculation one finds that 
this automaton attains 24 different states if we consider the set of all sequences 
=Saii. By a suitable elimination of the nondeterminism, we can readily reduce the 
number of states to 10 (cf. Appendix |F]) yielding an upper bound on the required 
memory and hence 2 < M(^aii) < logs (10) ~ 3.32. 

8. Conclusions 

We have investigated the amount of memory needed in order to simulate quan- 
tum contextuality in sequential measurements. We determined the memory-optimal 
automata for important cases and have proven that the simulation of contextuality 
phenomena for two qubits requires more than two classical bits of memory. How- 
ever, the maximal amount of classical information that can be stored and retrieved 
in two qubits is well known to be limited to two bits (l9| . This implies that any 
classical model of such a system either would allow to store and retrieve more than 
two bits or would have inaccessible degrees of freedom. (An example of the lat- 
ter is A3, since one cannot perfectly infer the initial state from the results of any 
measurement sequence.) 

It should be emphasized that our analysis is about the memory that is needed 
to classically simulate the certain predictions from measurement sequences on a 
quantum system. In contrast, one may ask how many different states are needed 
to merely explain the observed expectation values J50-35J. However, the number of 
states needed in this scenario measures the number of different initial configurations 
of the system, while we have shown that even for a fixed initial configuration, 
the system must eventually attain a certain number of states during measurement 
sequences. Similarly, it has been demonstrated that a hybrid system of one qubit 
and one classical bit of memory is on average superior to a classical system having 
only access to a single bit of memory [36] - while we show in Theorem [U that 
for a two-qubit system, even the certain predictions cannot be simulated with one 
classical bit of memory. 

Our work provides a link between information theoretical concepts on the one 
side and quantum contextuality and the Kochen-Specker theorem on the other side. 
While for Bell's theorem such connections are well explored and have given deep 
insights into QM [iSl 1371 . |38| , for contextuality many questions remain open: If 
an experiment violates some noncontextuality inequality up to a certain degree, 
but not maximally, what memory is required to simulate this behavior? Can non- 
deterministic machines help to simulate contextuality? What amount of memory 
and randomness is required to simulate all quantum effects in the PM square, es- 
pecially in the distributed setting 113]? Finally, for quantum non- locality it has 
been extensively investigated why QM does not exhibit the maximal non-locality 
[371 1381 ]. A similar situation occurs for quantum contextuality — can concepts 
from information theory also help to understand the nonmaximal violation in this 
situation? 
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Appendix A. ^3 is optimal 

We already defined the set of row and column sequences ^^c in Eq. (|3]) . Another 
natural constraint is given by the set of repeated measurements 

^repeat = {AA, BB, CC, aa, bb, cc, aa, /3/3, 77}, (9) 

where we expect for any of these pairs that the results in the first and the second 
measurement coincide. Both sets i^ic and =Sicpcat obviously are subsets of the set 
of contextuality sequences ^context of the PM square. Nevertheless, an automaton 
that simultaneously obeys i?rc and i?ropoat already possesses more than two internal 
states, i.e. M(i2i.c and i?ropeat) > 1- In order to see this, assume that the automaton 
has only two internal states and without loss of generality that it starts in state 
Si . We consider the case where in the last column there must be a prescribed state 
change in order to avoid a contradiction, i.e., in 5*1 the product of the assignments 
of Cc7 is +1, contrary to the quantum prediction. Note that there always exists 
at least one row or column with such a contradiction, and that the proof for any 
row or column follows the same lines. If there is only one state change (say after a 
measurement of 7), then, while measuring the sequence Cc7, the automaton would 
remain in Si until after the last output and therefore yield a contradiction. If there 
are two (or more) state changes in the last column (say c and 7), both must go 
to 5*2. Then, the constraints from i?rcpcat require that 7 has the same values in 
Si and S2 (this is also true for c). But then the sequence Cc7 in i^rc will yield a 
contradiction. Thus a two-state automaton cannot obey both, i^^c and ^repeat- 

On the other hand, ^3 is an example of a 3 state automaton, which obeys ^rc 
and ^repeat- In fact ^^3 obeys =S^ontoxf In order to see this, it is enough to show, 
that for any choice of the initial state, the automaton will obey =Scontoxt- So we 
assume that Si is the initial state; the reasoning for S2 and 5*3 is similar. If we 
now measure a sequence with observables from the first row only, we may jump 
between the states 5*1 and S2 , but the output for all observables in the first row are 
the same for either state. A similar argument holds for all rows and the first and 
second column. For a sequence with measurements from the third column, assume 
that the first observable in the sequence, that is not 7, is the observable c. Then 
the state changes to 6*3, in which the last column does not yield a contradiction. 
Since only the output C was changed, but C was not measured so far, we cannot 
get any contradiction. A similar argument can be used for the case where the first 
observable in the sequence, that is not 7, is the observable C. 

In summary, since any automaton that obeys ^context has at least 3 states and 
^3 is a 3 state automaton obeying the larger set ^context) we have shown that A^ 
is memory-optimal for either set. 

Appendix B. Aa obeys ^^ontcxt and ^^^^p^t 

In this Appendix we demonstrate that the automaton Aa indeed obeys ^^q^^j,^^ 
and ^compat ■ The proof for i^contcxt i^ completely analogous to the one in Appen- 
dix E] 

For ^compat; we cousidcr a fixed observable, e.g. B. Then Si and 5*2 yield -f 1 
while S2, and ^4 give —1. However, using arbitrary measurements compatible to 
B, (i.e., A, B, C, b, and /3) we can never reach S3 or 5*4 if we start from ^i or 
S2 and vice versa. Hence no contradiction occurs for any sequence of the type 
lT\B[XiX2 ■ ■ -IB. A similar argument holds for all observables, if we note in 
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addition that e.g. after a measurement of C the automaton can only be in ^2 or 

Appendix C. Definitions and basic rules used in the optimality proofs 

As we already did in the main text, we denote the observables from the PM 
square by 

'A B C 
a b c . (10) 

a /? 7 

Furthermore, we denote the rows of the square by Ri, and the columns by Cj. The 
value table of each memory state i is denoted by Ti and the update table by Ui. 
We write an entry of zero in [7^, if the state does not change for that observable. 
Furthermore, we write measurement sequences as A^ B2 C2 a^ meaning that when 
the sequence ABC a was measured, the results were +,—,—,+, and the memory 
was initially in state Si and changed like Si ^^ S2 ^^ S2 ^^ Sy,. 

It will be useful for our later discussion to note some rules about the structure 
of the value and update tables. 

1. Sign flips: Let us assume that we have an automaton obeying ^context ^ii^i 
■^compat (o'' some subset of those sets) and pick a 2 x 2 square of observables 
(e.g., the set {A, B, a, b} or {A, B, a, /3} or {A, C, a, 7}). Then, if we flip in 
each Ti the signs corresponding to these observables, we will obtain another 
valid automaton. 

This holds true, because the mentioned sign flips do not change any of 
the certain quantum predictions from ^S^q^^^^^j or =S^ompaf This rule will 
allow us later to fix one or two entries in a given value table Tj. 

2. Number of contradictions: Any table Ti contains either one, three, or five 
contradictions to the row and column constraints. 

This follows directly from that fact that any fixed assignment fulfills 
Y[j^. RfeCfe = +1, while the row and column constraints require J^^, RfeCj; = 
-1. 

3. Condition for fixing the memory: Let us assume that we have an automaton 
obeying ^context ^^'^ l^t there be a table Ti which assigns to an observable 
(say A) a value different from all other tables. Then, the update table Ui 
must contain only zeroes in the corresponding row and column (here, Ri 
and Ci). 

The observables in the row and column correspond to compatible ob- 
servables, which are not allowed to change the value of the first observable. 
However, any change of the memory state would change the value, as Ti is 
the only table with the initial assignment. 

4. Contradictions and transformations: Let us assume that we have an au- 
tomaton obeying ^context ^^'^ ^^t there be in Ti some contradiction in the 
column Cj (or the row Rj). Then, in the update table Ui there cannot be 
two zeroes in the the column Cj (or the row Rj). 

If there were two zeroes, it could happen that one measures two entries of 
Cj without changing the memory state. But then measuring the third one 
will reveal the contradiction in Ti . (Note that the automaton first provides 
the result and then updates its state.) 

5. Contradictions and other tables: Let us assume that we have an automaton 
obeying i^^Q^^jj^t ^^d let there be in Ti a contradiction in the column Cj 
(or the row Rj). Then, there must be two different tables Tk and Ti where 
in both the column Cj has no contradictions anymore, but the assignments 
of Tk and Ti differ in two observables of Cj. Furthermore, in the column 
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Cj of the update table Ui there must be two entries leading to two different 
states. 

First, note that there must be at least one other table Tk where the 
contradiction does not exist anymore. This follows from the fact that we 
may measure Cj starting from the memory state i. After having made 
these measurements, we arrive at some state fc, and from the contextuality 
correlations ^context it follows that Cj in Tk has no contradiction. 

The table Tk differs from Ti in at least one observable X in Cj . On the 
other hand, starting from Ti one might measure X as a first observable. 
Then, making further measurements on Cj one must arrive at a table Ti 
without a contradiction. Since T^ and Ti have both no contradiction, they 
must differ in at least two places, one of them being X. Finally, if the 
column Cj in Ui would only have entries of zero and fc, then Cj in Tj. 
could not differ from Ti . This eventually leads to a contradiction and hence 
proves the last assertion. 



Appendix D. A4, is memory-optimal 
Here, we proof the optimality of the 4-state automaton A4, in the sense of 



obeying ^'^^ 



and ^' 



compat 



with a minimum number of states. We use the 



definitions and rules as introduced in Appendix [C] 

Let us assume that we would have a three-state automaton obeying ^context- ^1 
has a contradiction, and we can assume, without loss of generality, that it is C3. 
Then, according to Rule 1 we can, without loss of generality, assume that all entries 
in C3 are "-I-". Together with Rule 5 this leads to the conclusion that the three 
states Ti are, without loss of generality, of the form: 



Ti: 



Ui: 



To 



+ 



n 















2 


U2: 





c/3: 





3_ 













(11a) 



(lib) 



Here, empty places in the tables mean that the corresponding entries are not yet 
fixed. The table Ui follows from Rule 5, and f/2 and U3 follow from Rule 3. 

Which can be the entries corresponding to the observables a and & in [72? Since 
T3 assigns a different value to c than T2, there cannot be a "3" at these entries, 
otherwise, a sequence like €20.20'^ would lead to a contradiction to the conditions 

01 =-!^context • 

But there can also not be a "1" at these entries, because then the sequence 
'^t'^2li'^3 yields a contradiction to ^compat, since a and 7 are compatible with 
c. So the entries of R2 in U2 must be zero, as there are only three states in the 
memory. A similar argument can be applied to C/3, showing that here R3 must be 
zero. 

So the tables have to be of the form 



Ti: 



Ui: 



T2 



U2 



+' 


+ 





-J 





5 








n 



c/3 



+' 


0' 


f 













(12a) 



(12b) 
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Now, according to Rule 4, the contradictions in T2 as well as in T^ can only be in 
Ri. But, according to Rule 5, if there is a contradiction in Ri of T2, there must 
be two different Ti and Tj where there is no contradiction in Ri . But there is only 
one table left, namely Ti, and we arrive at a contradiction. 

Appendix E. Proof that the classical simulation of the extended PM 
square requires more than two bits of memory 

Let us now discuss the extended PM square from Ref. [29j. Again, we refer to 
Appendix [C] for basic definitions and rules. As already mentioned, one considers 
for that the array of observables 





Xoi 


X02 


Xio 


Xii 


X12 


X20 


X21 


X22 


X30 


X31 


X32 



1® (JX 1® ay 1 (8) (Tz 



Gy ®\ ay (g) ax ay (g) ay ay® az 
CTz (8) 1 az® ax az® a^ az ® az 



(13) 



X03 
Xl3 
X23 
X33. 

These observables can be grouped into trios, in which the observables commute and 
their product equals ±1. Nine trios are of the form {xfco, Xfci, Xoi}, three trios where 
the product equals +1 are {xii, X23,X32}, {xi2,X2i,X33}, and {xi3, X22,X3i}- 
Three trios where the product equals -1 are {xii, X22,X33}, {xi2,X23, X3i}, and 
{Xi3, X2i,X32}- From this, one can derive the inequality 

Efc.i (XfeOXfciXO/) + (X11X23X32) + (Xl2X2lX33) + 

+ (Xl3X22X3l) - (X11X22X33) - (Xl2X23X3l)- 

- (X13X21X32) < 9 (14) 

for noncontextual models, while QM predicts a value of 15, independently of the 
state. 

First note that in this new inequality 15 terms (or contexts) occur but any 
noncontextual model can fulfill the quantum prediction for only 12 of them at 
most, so three contradictions cannot be avoided. One can directly check that in 
the whole construction of the inequality, 10 different PM squares occur. Nine of 
them are a simple rewriting of the usual PM square, while the tenth comes from 
the observables Xki with k^l ^ 0. Any of the 15 terms in the inequality contributes 
to four of these PM squares. 

Any value table for the 15 observables leads to assignments to the 15 contexts, 
but it has at least three contradictions. As any context contributes to four PM 
squares, this would lead to 12 contradictions in the 60 contexts of the 10 PM 
squares, if we consider them separately. Since in a PM square the number of 
contradictions cannot be two (Rule 2), this means that one of the PM squares has 
to have three contradictions. 

Let us now assume that we have a valid automaton for this extended PM square 
with four memory states. Of course, this would immediately give a valid four-state 
automaton of any of the 10 PM squares. For one of these PM squares, at least one 
table has to have three contradictions. So it suffices to prove the following Lemma: 

Lemma 4. There is no four-state automaton obeying ■^compat ^'^'^ -^'context' where 
one table Ti has three contradictions. 

In course of proving this Lemma we will also prove the following: 

Proposition 5. The four-state automaton Ai is unique, up to some permutation 
or sign changes. 

To prove the Lemma, we proceed in the following way: Without loss of generality, 
we can assume that the first three tables Ti look like the Tj in Eq. (|llap . Then, 
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we can add a fourth table T4. For the last column of this table, there are 2^^ = 8 
possible values. We will investigate all eight possibilities and show that either 
we arrive directly at a contradiction, or that only an automaton similar to A^ is 
possible, in which any table has only one contradiction. This proves the Lemma. 

We will first deal with the four cases, where T4 has also a contradiction in C3. 
This will lead to Observation [SJ which will be useful in the following four cases. 



Case 1: For T4 one has [C, c, 7] = [+ + +]; 

In this case, a simple application of the previous rules implies that several entries 

are fixed: 



T 



(15) 



U 



~ 




0' 




0' 






2 












, 


2 


3 














3 



(16) 



Here and in the following, we write the Ti and Ui just as a row for notational 
simplicity, starting from Ti to T4. The entries in Ui and C/4 are fixed from the 
following reasoning: Let us assume that one measures c in Ti, then, since the 
values C{Ti) are the same in all Ti, one has to change immediately to a table 
with no contradiction in C3 , and where the value of c is still the same. The only 
possibility is T2. Furthermore, R2 in U2 and R3 in t/3 must be zero due to the same 
argument which led to Eq. (J12bp . 

It follows (Rule 4) that T2 and T^ have both exactly one contradiction, which 
must be in Ri. So, in Ri of U2 there must be the entries "1" and "4" [an entry 
"3" would not solve the problem, because in Ri(r3) has also a contradiction]. As 
we can still permute the first and second column, we can without loss of generality 
assume that the first row in U2 is [1 4 0]. Due to Rule 1, we can also assume, 
without loss of generality, that A{T2) — +■ Similarly, in Ri(f73) there must be the 
entries "1" and "4" , resulting in two different cases: 

If Ri(C/3) = [1 4 0], we must have the following tables. 



T 



(17) 



U 



■ 




"1 4 0' 




"1 4 0' 




" 


2 


7 












2 


3 














3 



(18) 



where the added values in Ri of the Ti follow from Ri(C/2) and Ri([/3). 

Now, if we start from T2 and measure the sequence 02^201, we see that we 
must have a{Ti) = a{T2). Similarly, from T^ we can measure a3^3ai, implying 
that a(Ti) = a(T2) = a{Tz). Similarly, we find that b{T2) = b{T3) = b{Ti). But 
this gives a contradiction: In R2(T'2) and R2(T'3) there is no contradiction and 
c{T2) 7^ c(r3). Therefore, it cannot be that a(T2) = a{T3) and at the same time 

h{T2) = bin). 

As the second case, we have to consider the possibility that Ri(C/3) — [4 10]. 
Then, also the values of Ri(T3) must be interchanged, Ri(T3) = [ — h +]. Then, 
starting from T2, the sequence Q;2^27ia3 shows directly that a{T2) = aiT^). Simi- 
larly, starting from T3, the sequence 03^30402 shows that a(T2) = aiT^). But since 
^(^2) 7^ ^{T^), this is a contradiction. 
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Case 2: For T^ one has [C, c, 7] = [H ]; 

As in Case 1, one can directly see that several entries are fixed: 



T 



U 







0' 




0' 






2 


7 












3 


3 














2 



(19) 



(20) 



The zeroes in U2 and U3 come from the following argumentation: Starting from Ti, 
the measurement sequence c{X2C-! with X compatible to c shows that in R2(f72) 
and Cz{U2) there can be no "3" or "4". But there can be also no "1", because 
then the sequence CiX2'jic^ would lead to a contradiction. Therefore, R2(C/2) and 
^3(112) have to be zero. Starting from T4 and measuring 7 one can similarly prove 
that the entries for R3(J72) have to be zero and analogous arguments prove also the 
zeroes in Us. 

It is now clear (Rule 4) that the contradictions in T2 and T3 have to be in Ri 
and the missing entries in U2 and C/3 can only be "4" and "1". As we still can 
permute the first and second column, there are only two possibilities: 

Case 2A: First, we consider the case that Ri(C/2) = Ri(C/3) = [14 0]. As in 
Case 1, we can directly see that a(T2) = a(Ti) = a(T3) and 6(T2) = 6(T4) = biTs). 
Hence, R2(T2) and R2(T3) differ exactly in the value of c, but in both cases there 
is no contradiction in R2. This is not possible. 

Case 2B: Second, we consider the case that the first rows of C/2 and C/3 differ, 
and we take Ki{U2) = [14 0] and Ri(C/3) = [410]. Then, we apply Rule 1 to fix 
for A{T3) = a{Ts) = +. Then, the tables have to be: 



T 



U 



' 




"1 4 0' 




"4 1 0' 






2 







: 





, 


3 


3 














2 



(21) 



(22) 



Here, C2(Ti) and Ci{T4) come from measurement sequences like a'^A^a'J, starting 
from T3. 

Again, we have two possibilities for the value of 6 in 72- If we set b(T2) = — , 
then all values in all Ti are fixed and each table has exactly one contradiction. This 
is, up to some relabeling, the four-state automaton A4 from the main text (indeed, 
this is the way how this solution was found). If we set fe(T2) = +, then also all Ti 
can be filled, and we must have: 



T 



U 







"1 4 0' 




"4 1 0' 






3 2 


7 





, 





, 


2 3 


2 3_ 














3 2_ 



(23) 



(24) 



Here, the tables Ti and T4 have three contradictions (two new ones in R2 and R3) 
and the new entries in Ui and f/4 must be introduced according to Rule 5 [note 
that a{Ti) and /3(Ti) are for all tables the same]. Then, however, starting from Ti, 
the sequence a^ A2Jia^ shows that this is not valid solution. 
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Case 3: For T^ one has [C, c, 7] = [ \ ]; 

In this case, a simple reasoning according to the usual rules fixes the entries: 



T : 



U : 









: 


0' 



: 


"0 0' 



3 

















(25) 



(26) 



Here we have an obvious contradiction in T4/C/4: C3(r4) contains a contradiction, 
but (due to Rule 3) one is not allowed to change the memory state when measuring 
it. Therefore, the memory can never be in the state 4. But then, one would have 
effectively a three-state solution, which is not possible, as we know already. 

Case 4: For T4 one has [C, c, 7] = [ h] .' 

This is the same as Case 3, where R2 and R3 have been interchanged. 

Now we have dealt with all the cases, where T4 contains a contradiction in C3, 
just as Ti. We have seen that in this cases there can only be a solution if each 
table contains exactly one contradiction, and this solution is unique, up to some 
permutations or sign flips. Moreover, we could have made the same discussion 
with rows instead of columns. Therefore from the first four cases we can state an 
observation which will be useful in the remaining four cases: 

Observation 6. // in any four-state solution two tables Ti and Tj have both a 
contradiction in the same column Cfc (or row RfcJ, then there has to be exactly one 
contradiction in each value table of the automaton. 

So, if there is a four-state solution where one table has three contradictions, then 
it cannot be that two tables have both a contradiction in the same column or row. 
Then we can proceed with the remaining cases. 

Case 5: For T4 one has [C, c, 7] = [+ H — ]•■ 

This is the critical case, as it is difficult to distinguish the tables T2 and T4 here. 

First, the following entries are directly fixed: 

+ 



T 



U : 



+ 



0|4 0|4 



0|4 















0|4 















0|2 


0|4 















0|2 



(27) 



(28) 



Here, c{Ui) = 2 has been chosen without loss of generality. It is clear that c{Ui) — 2 
or c{Ui) = 4, as T2 and T4 are equivalent at the beginning, we can choose T2 
here. The entries of the type i\j in U2 and U4 mean that the numbers can be i 
or j, but nothing else. The values of 0(1/2) [and c([/4)] cannot be 1, because then 
the sequence c^JiC'^ directly reveals a contradiction. Furthermore, the zeroes in 
R-3(C^3) and R2(C/3) follow similarly as Eq. (|12bl) or from Rule 3. In addition, 
C{U2) ^ 1, because otherwise the sequence cfC^7^ reveals a contradiction to 
the PM conditions. Also, 0(1/2) 7^ 3, because of CiC2cJ- Similarly, 1 and 3 are 
excluded as values for 0(1/2) and b(U2), due to the sequences c'^a2j^c^ and c^a^c^ . 
Furthermore, we can use our ObservationlHl If in a four-state solution one column 
has a contradiction in two of the Ti, then there can be only one contradiction in 
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any Tj. Here we can use it as follows: It is clear that T3 has its contradiction 
in Ri. Since we aim to rule out a four-state solution where one table has three 
contradictions, we can assume that there is no contradiction in Ri in all the other 
Ti (especially in T2 and T3). Otherwise, we would already know that no solution 
exists with three contradictions in a table. We can distinguish two cases: 
Case 5A: Let us assume that 7(^/2) = 0. Then, the tables must read: 



T 



U 



0|4 
0|4 



0|4 
0|4 



0|4 




0" 




0|4 









0_ 










0|2 
0|2 



(29) 



(30) 



The new entries in U2 follow from 7(1/2) = in combination with "f{Ti) = "/{T^) ^ 

7(^2). 

Due to Rule 5, the table T2 must have a contradiction in Ci, C2, or Ri. From 
Observation [51 we can assume that it is not in Ri. Due to possible permutations 
of Ci and C2 we further assume without loss of generality that the contradiction 
is in Ci. Then we have: 



U: 



1 

4 

0|4 



0|4 
0|4 



0|4 




0" 




0|4 


7 





7 


0_ 










0|2 
0|2 



(31) 



We cannot have A{U2) = 3, since there is a contradiction in Ri(r3) and C{Ti) = + 
for all tables. In addition, due to Rule 5, it is not possible that A{U2) = 4. Finally, 
we choose a{U2) = 4, the other option would be a{U2) = 4, this will be discussed 
below. 

From Observation[n]we can conclude that Ci(ri) and Ci(T4) do not contain con- 
tradictions, since Ci(T2) contains already a contradiction. So Ci(ri) and CilT^) 
must differ in two places (Rule 5). One of these places must be A{Ti) ^ A{T4). Let 
us assume that the second one is a(Ti) ^ a{T4), the other case [a{Ti) ^ Q:(T4)] will 
be discussed below. Then, we can conclude that in Ri(C/i) and C\{JJ\) we cannot 
have the entries "2" and "4", and in R2(C/4) and C\{Ua) we cannot have the entries 
"2" and "1". To see this, note that we must have ^(Ta) = ^(Ti) ^ Ai^^) and, if 
B{Ui) = 2, we can consider the measurement sequence ^2^102^4 or, if B{Ui) — 4, 
the sequence A2B1A4 etc. Hence, we have: 



U: 





0|3 
0|3 







1 




0|4 








0" 




4 


0|4 


0|4 













7 


0|4 


0|4 


0_ 
















0|3 

0|3 0|3 

0|3 





0|2 



Here, we used in Ri(L/i) that Ri(r3) has a contradiction and C{Ti) 
tables, so it is not possible to go there. 

Now, by Rule 1, we may fix A{T2) = ^(Ta) = +. Then we arrive at 



T 



U 



( 

0|3 


) 
2 




1 

4 


0|4 


0|3 


3 




0|4 


0|4 



0|4 




0' 




0|4 


7 





7 


0_ 










0|3 



0|3 







0|2 



0|2 



for all 



(32) 



(33) 



Here, we must have A{T/i) = a{T4) since Ci(T4) has no contradiction. Further- 
more, Ri(r4) has no contradiction due to Observation [51 The values of R2(f/4) 
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are determined by considering sequences like €4040?; and C{Ui) ^ 3, because of 
c^CiC^ , and C(C/4) 7^ 1, because of c^Ci^ic^ . 

In addition, we can conclude that A{U/C) = and B{U/C) — 0, since Ri(T3) has a 
contradiction and C{Ti) = + for all tables, so it is not possible to go there. Then 
we can fill T4 completely. Then, also C(C/4) = 0, otherwise the sequence B^C^B^ 
gives a contradiction. If we had a{U4) = 3, then we must have ^(24) = ^(Ts) = — 
and, consequently (Rule 5) B{U'i) == 1 or 2, but then the sequence A^ajB^A^.^ 
leads to a contradiction, so a{Ui) = 0. In summary, we have: 



T : 



(34) 



U 














1 




0|4 






















0|3 




2 


5 


4 


0|4 


0|4 













: 











0|3 




3 




0|4 


0|4 























0|2 



(35) 



Now Ti is the only candidate for a table with three contradictions. In order to obey 
Observation ini the only possibilities for contradictions are C2, C3, and R2, since 
T4 has its contradiction in R3. Especially, there must be a contradiction in C2(Ti). 
Then, in order to obey Rule 5, we must have: 



T: 



(36) 



U 














1 




0|4 




4 


1|2 
















0|3 


2|3 


2 




4 


0|4 


0|4 


, 











, 











0|3 


2|3 


3 




0|4 


0|4 























0|2 



(37) 



However, if b{Ui) = 3, then the sequence B^biA^B^ leads to a contradiction, 
while, if f3{Ui) — 3, then the sequence B^ liiA^B^ leads to a problem. 

Finally, if we would have taken a{U2) = 4 or a{Ti) ^ ol{Ta) the proof would 
proceed along the same lines, but this time the contradiction in T4 would be in the 
second row. 

Case 5B: Let us assume that 7(C/2) = 4. Then, many entries on f/4 are fixed 
and we have: 



T 



(38) 



U 







0|4 








0' 




0|4 


0|4 


0|4 


5 











: 






4 
















0|2 
0|2 



0|2 
0|2 



0|2 
0|2 
0|2 



(39) 



Here we cannot have aiJJ^) = 1, due the sequences 02720401 [if c{U-2) — 0] or 
C2a4Ci [if 0(^/2) ~ 4], and also not a{lJ^ — 3, due to similar sequences. The same 
arguments apply to b(U4). The entries in R3([/4) and C3(4) come from possible 
sequences like 7401477 if 7(1/4) = or 7472047? if 7(1/4) = 2. 

But then the proof can proceed exactly as in the Case 5A, with T2 and T4 
interchanged: The only significant difference comes from c{Ui) = 2^4, but this 
was never used in the proof. 



Case 6: For T4 one has [C, c, 7] = [^ \-]: 

This is the same as the Case 5 with a permutation of R2 and R3 . 
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Case 7: For T4 one has [C, c, 7] = [— 
In this case, the tables read: 



+]•■ 



U 



























0|4 


0|4 





, 











7 


0|2 


0|2 













0|4 


0|4 







0|3 


0|3 



(40) 



(41) 



Here, the entries in Ui have been chosen without loss of generality: From Rule 4 and 
5 it follows that one can restrict the attention to the cases where C3([/i) — [ , 2, 3], 
C3(C/i) — [ ,2,4], or C3([/i) = [ ,4,3]. We only consider the first possibility, in 
the other cases the proof is analogous and is left to the gentle reader as an exercise. 
The zeroes in U2, U3, and C/4 come from Rule 3. The entries o|2 in C/4 come from 
possible measurement sequences like 040403 or 04047103 which prove that there 
cannot be the entries "3" or "1" . The other entries can be derived accordingly. 

From Rule 5, it follows that in T4 the contradiction cannot be in the rows, so it 
has to be in the first or second column. Let us assume, without loss of generality, 
that it is in Ci{T4). Further, we can assume without loss of generality, that the 
values A and a in T4 are both "+" . Then, the tables can be more specified as 



T 



U 



+ 




"+ 




+' 




"+ 


+" 




"+ 


- 


-' 


+ 




+ 


+ 


+ 




- 


+ - 




+ 


+ 


+ 


+ 




_+ 


- 


- 




_ 


- +_ 




_ 


- 


+_ 


- 




0|1 




0' 




0|1 


0' 




"0 





0" 


2 


7 





0|4 





; 










2 


0|2 





3 


















0|4 




3 


0|3 






(42) 



(43) 



To see this, one first fills T4, then, together with the entries of Ci(C/4), many values 
of T2 and T3 are fixed. The entries o|i are justified similar to the reasoning above. 

In T2 as well as in T3 the contradiction has to be either in Ri or C2. However, 
there cannot be a contradiction in Ri. To see this, assume that there were a con- 
tradiction in Ri(T2). Then, starting from T2 we may measure the sequence C2A2B 
or C2B2A. According to Rule 5, we must end in two different T^. But the memory 
state can never change to T4 [because C(T4) = — ]. So we must have B{U2) = 3, 
but this will not escape the contradiction, since the values for A and C coincide in 
T2 and Ty,. So there is only Ti left, and we arrive at a contradiction. 

Consequently, the contradictions have to be both in C2(T2) and C2(T3). In 
principle, our Observation [5] implies already that we cannot find a solution with 
three contradictions in one table. But one can also directly prove that there is no 
solution at all. Wc have: 



T . 



U 



+ 



+ + 
- + 



+ 



0|1 


1 0' 







4 













0|1 


1 


0' 


















4 














0' 


2 


0|2 





3 


0|3 






(44) 



(45) 



Here, we must have B{Ti) — B(T2) — B{T^) = + due to measurement sequences 
like B+B+ or B+ B+ and p{Ti) = p{T2) due to P^ Bt Pi and 6(Ti) = b{Tz) due to 
b^B^bf . But then, starting from T2, the sequence /3;7-^2^^i^ reveals a contradiction 
to the PM conditions. 
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Case 8: For T^ one has [C, c, 7] 
In this case, we directly have: 



T: 



U : 


+' 
+ 
+ 


2 
3 




- 


+" 
+ 

0" 




1 


+' 
+ 

0" 
0, 





"0 


0' 




- 



(46) 



(47) 



Starting from T2 we may measure the sequence C2A2B or C2B2A. According to 
Rule 5, we must end in two different T^. But the memory state can neither change 
to T4 [because C(T4) = — ] nor to T3 [as Ri(T3) contains a contradiction]. So there 
is only Ti left, and we arrive at a contradiction. 

In summary, by considering all eight different cases we have shown that no four- 
state solution exists in which one table has three contradictions. This proves the 
claim. 



Appendix F. A 10-state automaton obeying all sequences 

In this Appendix we show an example of a lO-state automaton that obeys the 
set of all sequences ^^u- For that, we define 10 eigenstates of two compatible ob- 
servables. We let |y4.~_B+) be a quantum state with A\A^ B'^) — — |A^i3+) and 
B\A-B+) = +\A-B+). In this fashion we define the 10 states \A+B+), \A-B+), 
\C+c+), \C-c+), I7+/3+), |7",S+), \a+a+), \a-a+), \a+b+), and \B+b+). Any mea- 
surement of an observable from the PM square projects with finite probability any 
state of the set onto another state of the set. If e.g. the automaton is in state 
\A^B^) and we measure c, QM predicts a chance of 50% to get the outcome -f 1 
yielding the state \C~c'^), and a 50% chance to obtain —1 and the state \C~c~). 
The former state is in the set of the 10 states and hence our automaton would 
return -|-1 and change to the state |C^c+). We furthermore define that, if both 
states predicted by QM are in the set of the 10 states, then we prefer the state 
corresponding to the output of -1-1. Together with an arbitrary choice of the ini- 
tial state, this completes the definition of the automaton. By construction, this 
automaton is deterministic and obeys .^aii- 
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